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Chromatic number, clique subdivisions, and the conjectures of Hajos 

and Erdos-Fajtlowicz 
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D ■ Abstract 

For a graph G, let xiG) denote its chromatic number and <t{G) denote the order of the largest 
clique subdivision in G. Let H{n) be the maximum of x(G')/cr(G) over all n-vertex graphs G. 
A famous conjecture of Hajos from 1961 states that cr(G) > x{G) for every graph G. That is, 
H{n) < 1 for all positive integers n. This conjecture was disproved by Catlin in 1979. Erdos 
and Fajtlowicz further showed by considering a random graph that H{n) > cn}'"^ / \ogn for some 
absolute constant c > 0. In 1981 they conjectured that this bound is tight up to a constant factor 
'•pi \ in that there is some absolute constant G such that x{G)/u{G) < Cm}^^ /\ogn for all n- vertex 

2 ' graphs G. In this paper we prove the Erdos-Fajtlowicz conjecture. The main ingredient in our 

proof, which might be of independent interest, is an estimate on the order of the largest clique 
subdivision which one can find in every graph on n vertices with independence number a. 

>: 

^ ■ 1 Introduction 

A subdivision of a graph H is any graph formed by replacing edges of H by internally vertex disjoint 
paths. This is an important notion in graph theory, e.g., the celebrated theorem of Kuratowski uses 
it to characterize planar graphs. For a graph G, we let (t{G) denote the largest integer p such that 
G contains a subdivision of a complete graph of order p. Clique subdivisions in graphs have been 
extensively studied and there are many results which give sufficient conditions for a graph G to have 
^ ■ large ct{G). For example, Bollobas and Thomason [5], and Komlos and Szemeredi |13j independently 

proved that every graph of average degree at least d has a{G) > cd^'"^ for some absolute constant 
c. Motivated by a conjecture of Erdos, in [2] the authors further showed that when d = r2(n) in 
the above subdivision one can choose all paths to have length two. Similar result for subdivisions of 
general graphs with 0{n) edges (a clique of order 0{^/n) clearly satisfies this) was obtained in |10j . 

For a given graph G, let xiG) denote its chromatic number. A famous conjecture made by Hajos in 
1961 states that cr{G) > x{G). Dirac [7J proved that this conjecture is true for all x{G) < 4, but in 
1979, Catlin [6] disproved the conjecture for all x{G) > 7. Subsequently, several researchers further 
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studied this problem. On the negative side, by considering random graphs, Erdos and Fajtlowicz 
[8] in 1981 showed that the conjecture actually fails for almost all graphs. On the positive side, 
recently Kiihn and Osthus [TB] proved that all graphs of girth at least 186 satisfy Hajos' conjecture. 
Thomassen [19] studied the relation of Hajos' conjecture to several other problems of graph theory 
such as Ramsey theory, maximum cut problem, etc., and discovered many interesting connections. 

In this paper, we revisit Hajos' conjecture and study to what extent the chromatic number of a 
graph can exceed the order of its largest clique subdivision. Let H(n) denote the maximum of 
xiG)/<y{G) over all n-vertex graphs G. The example of graphs given by Erdos and Fajtlowicz which 
disprove Hajos' conjecture in fact has (j{G) = Q{ii}''^) and x{G) = B(n/logn). Thus it implies that 
H{n) = Q.{n}''^ /\ogn). In [8], Erdos and Fajtlowicz conjectured that this bound is tight up to a 
constant factor so that H{n) = 0{it}''^ /logn). Our first theorem verifies this conjecture. 

Theorem 1.1 There exists an absolute constant C such that H{n) < Cn^'"^ / \ogn for n >2. 

The proof shows that we may take C = 10^^*^, although we do not try to optimize this constant. 
For the random graph G = G{n,p) with < p < 1 fixed, Bollobas and Catlin [4] determined 
cr{G) asymptotically almost surely and later Bollobas [3j determined x{G) asymptotically almost 
surely. These results imply, by picking the optimal choice p = 1 — e~^, the lower bound H{n) > 
(;^-o(l))ni/2/logn. 

For a graph G, let a{G) denote its independence number. Theorem 11.11 actually follows from the 
study of the relation between cr{G) and a{G), which might be of independent interest. Let f{n,a) 
be the minimum of cr{G) over all graphs G on n vertices with a{G) < a. 

Theorem 1.2 There exist absolute positive constants ci and C2 such that the following holds. 

a. 

1. If a < 21ogn, then f{n,a) > cin2«-i, and 

2. if a = alogn for some a>2, then f{n,a) > C2^/ ^|" . 

Note that for a = 21ogn, both bounds from the first and second part gives f{n,a) > 0,(y/n). 
Moreover, both parts of this theorem establish the correct order of magnitude of /(n, a) for some 
range of a. For a = 2, it can be shown that in the triangle- free graph constructed by Alon [1], every 
set of size at least 37n^' ^ contains at least n edges. This implies that the complement of this graph 
has independence number 2 and the largest clique subdivision of size t < 37n^'^. Indeed, if there is 
a clique subdivision of order t > 37n?'^, then between each of the at least n pairs of nonadjacent 
vertices among the t vertices of the subdivided clique, there is at least one additional vertex along 
the path between them in the subdivision. However, this would require at least t + n vertices in the 
n-vertex graph, a contradiction. On the other hand, for a = 0(logn), by considering G{n,p) with 
constant < p < 1, one can see that the second part of Theorem 11.21 is tight up to the constant 
factor. Even for a = o(logn), by considering the complement of G{n,p) for suitable p <^ 1, one can 

1 c' 

easily verify that there exists an absolute constant c' such that f{n,a) < 0{n^~^~y 

Theorem 11.21 can also be viewed as a Ramsey- type theorem which establishes an upper bound on the 
Ramsey number of a clique subdivision versus an independent set. 



Notation. A graph G = {V, E) is given by a pair of its vertex set V = V{G) and edge set E = E{G). 
The edge density of G is the ratio |i?|/('2 )• For a subset X of vertices, we use G[X] to denote the 
induced subgraph of G on the set X. Throughout the paper log denotes the natural logarithm. 
We systematically omit floor and ceiling signs whenever they are not crucial, for the sake of clarity 
of presentation. We also do not make any serious attempt to optimize absolute constants in our 
statements and proofs. 

2 Deducing Theorem 11.11 from Theorem 11.21 

Theorem 11.11 is a quick corollary of Theorem 11.21 To see this, let C = max ( e^, ^, —^ I , where 
ci,C2 are the constants from Theorem 11.21 We will prove Theorem 11.11 by induction on n. Suppose 
we want to prove the claim for n, and are given a graph G on n vertices with x{G) = k. We may 
assume that k > C (and thus n > C) as otherwise the claim is trivially true. If a := a{G) < An/k, 
then the bounds in Theorem 11.21 easily give us the desired bound. Indeed, consider the two cases. 
Let a = a/ log n. If also a < 2 log n, so that a < 2, we can use the first part of Theorem 11.21 to get 
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where we used the fact that the minimum of ae^ in the domain (0, 2] occurs at a = 1/4. If 
ol{G) > 2 logn, so that a > 2, then by using the second part of Theorem 11.21 we get 

x(G) \/a log a ■ k >/a log a ■ An/ {a log n) _ n^l"^ 4A/log a n}!"^ 4 n^l'^ 

a{G) ~ C2\fn ~ C2\Jn logn C2^/a ~ logn c^s/e ~ logn' 

where we used the fact that the maximum of -^^^ occurs at a = e. 

Otherwise, a(G) > An/k. By deleting a maximum independent set, we get an induced subgraph G' 
on n' < n — An/k vertices, with chromatic number at least k — 1, and a clique subdivision of size at 
least o-(G') > x{G')/H{n') > {k - l)/H{n'). Note that if n' < e^, then A; < 1 + n' < 9 < C, and this 
case was already settled. So we may assume n' > e^. Hence, by induction on n, we have 

^(G) < JL. < (^ HW) <J^.^-^. (1) 



a{G) - a{G') - \k - 1 J ' ''k-l logn' 

As the function x^'^/logx is increasing for x > e^, the right-hand side of (JI]) is maximized at 
n' = (1 — A/k)n. Consequently, we have 
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where in the second to last inequaHty we used 



x^ x^ 



log(l -x) = -{x + — + — H ) > -{x + x^ + x"* H ) > -2x 

for < X < 1/2, which holds with x = A/k from k > C > e^, and in the last inequality we used 
n> C > e^. We therefore have H{n) < Cn^''^/logn, which completes the proof. □ 

3 Tools and the idea of the proof 

The proof of Theorem 11.21 makes use of four main tools that we describe in this section. In the end 
of the section we outline the proof of Theorem 11.21 using these tools. 

Our first tool is a theorem independently proved by Bollobas and Thomason [5j, and Komlos and 
Szemeredi [13]. They determined up to a constant factor the minimum number of edges which 
guarantees a i^i-subdivision in a graph on n vertices, solving an old conjecture made by Erdos and 
Hajnal, and also by Mader. 

Theorem 3.1 (Bollobas-Thomason, Komlos-Szemeredi) Every graph G with n vertices and 
at least 256t^n edges satisfies (t{G) > t. 

We remark that Theorem 13.11 implies H{n) = 0{n^''^). Indeed, a graph G with chromatic number k 
has a subgraph with minimum degree at least k—1 and hence by Theorem l3 . 1 [ satisfies cr{G) = Q{k^''^). 
We thus get ^^4 = 0{k^''^) = 0{v}i'^). As this bound holds for all G on n vertices, we have 
H{n) = 0{n^''^). Our goal is to prove the better bound H{n) = 0(n"'^"/logn). 

The theorem above can be used as a black box to indirectly construct clique subdivisions in certain 
cases. However, in order to directly construct a large clique subdivision in a graph, we first find a 
large subset in which only a small number of edges is missing, and then for each such missing edge, 
find internally vertex-disjoint paths connecting the two endpoints. For technical reasons, we reverse 
the two steps. That is, we first find a large subset of vertices such that every pair of vertices can 
be connected by many internally vertex-disjoint paths (Lemma 13. ip . and then find a further subset 
in which only a small proportion of edges is missing (Lemma 13. 2p . It would be nice if these two 
steps were sufficient in proving Theorem 11.11 Unfortunately, a naive application of these two steps 
together with Theorem 13.11 will only imply our main result for a certain range of parameters, more 
precisely, when the graph is dense enough depending on the independence number. Thus to handle 
the case when the graph is sparse, we develop another lemma (Lemma 13. 3p . which essentially says 
that in the sparse case, we can find a subgraph in which the parameters work (see the discussion 
before Lemma [3l3]). 



The next tool is based on a simple yet surprisingly powerful lemma whose proof uses a probabilistic 
argument known as dependent random choice. Early versions of this technique were developed in 
the papers [12l [lH [T7] . Later, variants were discovered and applied to various problems in Ramsey 
theory and extremal graph theory (see the survey |11] for more details). The following lemma says 
that every graph of large enough density contains a large subset in which every pair of vertices are 
connected by many internally vertex-disjoint paths. 



Lemma 3.1 Assume that d and n are given so that d'^n > 1600. If G = {V,E) has n vertices and 
edge density d, then there is a vertex subset U C V with \U\ > (in/50 vertices such that every pair 
of vertices in U have at least W''^d^n internally vertex- disjoint paths of length 4 which uses only 
vertices from V \U as internal vertices. 

Proof: Let Vi be a random subset of V of size [n/2] and let V2 = ^\^i- Then it is easy to check that 
E[e(yi, V2)] > 1(2) and therefore we may pick such a partition y = Vi U ^2 with e{Vi, V2) > ^{2)- 
Throughout the proof we restrict our graph to the bipartite graph induced by the edges between Vi 
and V2. 

Pick a vertex vq £ V2 uniformly at random, and let X C Fi be the neighborhood of vq. The 
probability of a fixed vertex v £ Vi belonging to X is F{v £ X) = deg(f)/|V2| and thus by the 
Cauchy-Schwarz inequality and n > 2, 




F2I y " V 2 J - 16 

Call a pair (v, w) of vertices in Vi bad if v and w have at most d^n/800 common neighbors, and 

call it good otherwise. Let b be the number of bad pairs in X. Note that for a bad pair {v,w), the 

probability that both v and w belongs to X is at most F{v,w G X) < d'^n /{800\V2\). Consequently, 

the expectation of b is at most 

d'^n fn\ d^n^ 



EH<^7^7^U < 



8001^21 \2) - 800 
Thus we have 

u I J - 80 

Therefore we have a choice of vq for which \X\'^ — 406 > (i^n^/80 > 0. Fix this choice of vq (and X). 
Note that this in particular implies \X\ > dn/10 and b < |Xp/40. 

Call a vertex in X bad if it forms a bad pair with at least |^|/4 vertices in X. By the bound on 
b, we know that there are at most |-^|/5 bad vertices in X. Let U be an arbitrary subcollection of 
non-bad vertices in X of size |-^|/5 > dn/ 50. We claim that U is a set which has all the claimed 
properties. 

Since the vertices of U form a bad pair with at most |X|/4 vertices in X, for every two distinct 
vertices v,w in U, the number of vertices in X \U with which both v, w form a good pair is at least 

ixi-ic/i-2^m «i 



4 10 

Moreover, whenever we have a vertex x which forms a good pair with both v and w, by the definition 
of a good pair, we can find at least ((i^n/800)(d^n/800 — 1) paths of length 4 connecting v and vu 
which uses only vertices from V \U as internal vertices. Therefore by collecting the facts, we see 
that given d^n > 1600, the number of such paths of length 4 between v and w is at least 

Ml) . (fl) (fl-,)> '-"f'-' > io-d-^„3, 

10 J \800j V800 J - 100 -800 -1600 " 



Note that interior (without endpoints) of any given path of length 4 connecting v and w can intersect 
at most 3n^ other such paths. This imphes that there are at least 1{)~^ d^n^ / {?>n?) > W^cf'n 
internally vertex-disjoint paths connecting v and w which uses only vertices from V \U as internal 
vertices. This completes the proof. □ 

The following lemma asserts that every graph of small independence number contains a large subset 
in which only a small proportion of edges are missing. 

Lemma 3.2 Let < p < 1 and a be a positive integer. Then for every positive integer s < \p°'~^n] , 
every graph G on n vertices with independence number at most a contains a subset of size s with at 
most ps^ nonadjacent pairs of vertices. 

Proof: If s = 1, then the claim is clearly true. Thus we assume that s > 2. Let t be an integer 
satisfying t > s. It suffices to find a subset of order t which has at most pt^ /2 nonadjacent pairs, 
since by an averaging argument over all subsets of this t-set of order s, we can find a subset of order 
s which has at most 

edges missing. 

Let Vb = V{G). We will find a sequence Vq ^ ^i ^ " " " of subsets such that the induced subgraph of 
G with vertex set Vi has independence number at most a — i and at least p^n vertices. Notice this is 
satisfied for z = 0. If V^ has a vertex which has at least p\Vi\ non-neighbors in Vi, then let T^+i C Vi 
be the subset of non- neighbors, so |Vi+i| > p\Vi\. Since the induced subgraph of G with vertex set 
Vi+i has independence number at most a — i — 1, we can continue the induction. Otherwise, every 
vertex of Vi has less than p\Vi\ non-neighbors, so there are less than p\Vi\'^ /2 nonadjacent pairs in Vi, 
in which case we are done. If this process continues through a — 1 steps, we get a set Va-i of order 
at least p"~^n, and independence number at most one, so this is a clique of order at least s, which 
completes the proof. □ 

Suppose we are trying to prove Theorem 11.21 for a < 21ogn. First apply Lemma 13.11 to find a 
subset U of size Q{dn) in which each pair is connected by Q,[d^n) internally vertex-disjoint paths of 
length 4. Then apply Lemma 13.21 to U with a suitable choice of p, and hope to find a subset of size 
0(n"' '■^"~^'*), in which 0{d^n) edges are missing. By Lemma [3. II we can use internally vertex-disjoint 
paths of length 4 instead of missing edges to get a clique subdivision on these vertices. A crucial 
observation is that this only works if U is large enough (that is, if d is large enough). 

Our next lemma can be used to overcome this difficulty. The idea of this lemma first appeared in a 
1972 paper of Erdos-Szemeredi [9], and has also been useful in other problems (for example, [18J). It 
shows that if a sparse graph does not have large independence number, then it contains an induced 
subgraph with many vertices whose independence number is much smaller then in the original graph. 
We will later see that with the help of this lemma, the strategy above can be modified to find a subset 
of size 0(n'^'(^"~^'') with 0{d^n) non-adjacent pairs (we use the same strategy for a > 21ogn). 



Lemma 3.3 Let < d < 1. Let G be a graph, I be a maximum independent set of G with \I\ = a, 
and Vi be a vertex subset ofV\L with \Vi\ = N such that each vertex in V\ has at most d\L\ neighbors 
in L. Then there is a subset U C Vi with \U\ > (^'j A^ such that the induced subgraph of G with 
vertex set U has independence number at most da. 

Proof: For every vertex u G Vi fix a subset of / of size [d\L\\ whicli contains all neighbors of 
V in /. Since the number of such subsets of I is at most (ijijii) < (f) , one of them contains 

neighborhoods of at least |Vi| (|) vertices of Vi. Let U C Vihe the set of these vertices. We have 
that \U\ > (l) N, and the number of vertices in / which have a neighbor in U is at most d\L\. 
Note that all the vertices in I which do not have a neighbor in U can be added to an independent 
subset of U to make a larger independent set. Since / is an independent set of size a, there are at 
least (1 — d)a such vertices. Moreover, since G has independence number at most a, the induced 
subgraph oi G on U has independence number at most da. □ 

3.1 Outline of the proof 

We next outline the proof of Theorem 11.21 which gives a lower bound on cr(G) for a graph G with n 
vertices and independence number a. The proof strategy depends on whether or not the graph G is 
dense. 

When G is dense the proof splits into two cases, depending on the size of a (see Lemma |4. II in the 
next section). If a < 21ogn, then we apply Lemma l3. II to obtain a large vertex subset U in which 
every pair of vertices in U are the endpoints of a large number of internally vertex-disjoint paths of 
length 4. We then apply Lemma [32] to obtain a subset S C U oi large order s such that G[S] has 
few missing edges. The vertices of S form the vertices of a X^-subdivision. Indeed, for every pair of 
adjacent vertices in S, we use the edges between them as paths, and for every pair of non-adjacent 
vertices, we use paths of length 4 between them. These paths can be chosen greedily using that each 
pair of vertices in S are the endpoints of many internally vertex-disjoint paths of length 4 and there 
are few missing edges within S. This completes the case a < 2 log n of Lemma 14.11 If q > 2 log n, 
using the fact G is dense, we apply Theorem 13. II to obtain the desired large clique subdivision. 

For sparse G we prove a lower bound on cr{G) in terms of the number n of vertices, the independence 
number a = a{G), and the edge density d of G (see Lemma [4. 21 in the next section). The proof is by 
induction on n, the base case n = 1 being trivial. The cases d < n^^'^ or a > n/16 can be trivially 
verified, so we may suppose d > n~^'^ and a < n/16. One easily finds an independent set / and a 
vertex subset V" which is disjoint from / with \V"\ > n/8 such that / is a maximum independent 
set in G[V" U /] such that every vertex in V" has at most 8d|/| neighbors in /. If G[y'] has edge 
density at most d/lO, then, by the induction hypothesis, G[y'], and hence G as well, contains a 
ii's-subdivision of the desired size. So we may suppose G[y'] has edge density at least d/10. Apply 
Lemma l3.1l to find a large subset Vi C V" such that every pair of vertices in Vi are the endpoints of 
a large number of internally vertex-disjoint paths of length 4. Applying Lemma 13.31 we find a large 
subset U C Vi such that the independence number of G[Vi] is small. Finally, we then apply Lemma 
to obtain a subset S* C f7 of large order s such that G[S] has few missing edges. Just as in the 



dense case discussed above, the set S form the vertices of the desired K^-subdivision. 

4 Proof of Theorem 11.21 

In this section we prove Theorem 11.21 using the tools and the strategy we developed in the previous 
section. We separately consider two cases depending on the relation between the edge density d and 
the independence number a of the graph. The following lemma establishes the case when the graph 
is dense. 

Lemma 4.1 Fix a constant < c < 1. The following holds for every graph G with n > 10^"^ c~^ 
vertices, edge density d, and independence number a. 

(i) If a <2logn and d>c, then a{G) > lO-^c^/^^^/^^a-i)^ 

(a) If a = alogn for some a>2 and d > c/(a log a), then (t{G) > \/65o\/ aioga " 



Proof: (i) Given a graph G as in the statement of the lemma, since d'^n > 1600, we can apply 
Lemma |3. II to get a vertex subset U of size (in/50 such that every pair of vertices in U have at least 
10~^d^n internally vertex-disjoint paths of length 4 between them whose internal vertices lie in V\U. 

We may assume a > 2, as otherwise a = 1, G is a clique, and cr(G) = n. By applying Lemma 13.21 to 
U with p = (lO~'^d^n~^^) (note that p < 1), we find a vertex subset S C U of size 



s=\p--^\U\] 
with at most 



^ 50 



> 10-6d5/2„"/{2a-l) 



ps'^ < 2p2"-i|C/|^ = 2 • lO^^d^n"^ • {dn/50f < IQ-^^d^n, 
nonadjacent pairs, where we used the fact that s^ < 2p'^'^~'^\U\'^ , which follows from the inequality 

S > 10-6d5/2„a/(2a-l) > ;lq-6^5/2^1/2 > ^q ^^g^j^j^ ^j^^^^ ^ > lO^'^C-^). 

We claim that the vertices of S form the vertices of a i^g-subdivision. For every pair of adjacent 
vertices in S, we use the edges between them as paths, and for every pair of non-adjacent vertices, 
we use paths of length 4 between them. Since the number of non-adjacent pairs of vertices is at most 
lO^^'^d^n, and each such pair has at least W~^d^n > 3 • W^^^d^n internally vertex-disjoint paths of 
length 4 connecting them which uses only vertices from y \ 5 as internal vertices, we can greedily 
pick one path for each non-adjacent pair to construct a i^s-subdivision. Indeed, note that the use of 
a certain path of length 4 can destroy at most 3 other such paths for each other non-adjacent pair 
since they have disjoint interiors. 

(ii) Since d > c/(a log a), the total number of edges in the graph is 

/n\ cin — 1) 

d[ > -^ ^n. 

V2y - 2a log a 



Therefore by Theoreni l3.lt we can find a i^s-subdivision for s satisfying 



/ c(n — 1) / c I n" 



512a log a y 600 y a log a 
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Let f{n,a,d) be the maximum t such that every graph G on n vertices with independence number 
at most a and edge density d contains a Xj-subdivision. First note that by Turan's theorem, we 
have a lower bound on d in terms of a. 

Proposition 4.1 Let G be a graph with n vertices, edge density d, and independence number a. If 
a < n/2, then d > l/(2a). 

Proof: By Turan's theorem and convexity of the function g{x) = {^), we know d(^j > a("2 ) ~ 
n{n/a — l)/2, from which it follows that d > „°"^ —2a- ^ 

The next lemma establishes a good bound on /(n, a, d) when d is small (depending on q). This can 
be used to handle the remaining case of the proof of Theorem 11.21 

Lemma 4.2 Let G be a graph with n vertices, edge density d, and independence number a. If 
a < n/2, d < 10^^°, and dalog{l/d) < (logn)/100, then we have f{n,a,d) > (l/50)(i^n2 + 40d^. 

Proof: The proof is by induction on n. Note that by a < n/2 and Proposition 14.11 we have 
d > l/(2a). Thus we only need to consider the range l/(2a) < d < 1. 

We first verify some initial cases. Namely, we prove that if either d < n^^'"^ or a > n/16 holds, then 
the bound f{n,a,d) > (1/50)^^722 + 40^^ is true. 

If d < n"-*^'^, then by the fact d > l/(2a) we have 

(l/50)d'^n2 + 40d^ < n~"'^n2 + 2o < 1. 

Therefore the statement is true in this case by the trivial bound f{n,a,d) > 1. From now on, we 
may assume that d > n^^'^, from which it follows that n > d~^ > 10®". 

If a > n/16 and d > n^^'^, then by applying Theorem 13.11 we can find a clique subdivision of order 
at least 

We claim that this is larger than (l/50)(i^n2 + 40d^. Indeed, it suffices to show that (l/50)d^n40d^ < 
y^d/600, which is implied by 

(f/'^ni^ < 2. 

In the range n~^'^ < d < 1 with a > n/16, the left-hand side is an increasing function and hence 
maximized at d = 1. When d = 1 we have d^/^n^od^ < ns^ < 2. This finishes the proof of the initial 
cases. 



Now assume that some n is given and the lemma has been proved for all smaller values of n. By the 
observations above, we may also assume that d > l/(2a), a < n/16, and d > n~^'^, which implies 
n > 10®". Let G be a graph on n vertices with edge density d, and independence number at most a. 
Let V C V{G) be the set of vertices of degree at most 2dn, so \V'\ > n/2. 

Let / be a maximum independent set in the induced subgraph of G with vertex set V . Note that 
|/| < a. Let X C y be the set of vertices with at least 8d|/| neighbors in /, and let V" = V'\{XUl). 
Then by counting the number of edges incident to vertices of I in two different ways, we get 

\X\ -Sdl/I < |/| -Idn, 

from which we get the bound |X| < n/4. Therefore, we get \V"\ > \V'\ — \X\ — \I\ > n/8. Let 
n' = \V"\ > n/8, and let d' be the edge density and a' be the independence number, respectively, of 
the graph G[V"]. Note that a' < a. 

Case 1 : d' < d/10. 

We want to apply the inductive hypothesis in this case. We first show that the new parameters 
n',d',a' satisfy all the imposed conditions. First we have a' < a < n/16 < n'/2, and second 
d' < d < lO"^*^. Finally, since tlog(l/t) is increasing for t < e~^, 

,/ /I /. / ,/N ^ d nn/^^ ^ dalog{l/d) logn log(n') 

da log 1/a < — alog 10/a < < < . 

&w ; - -^Q &\ / J - 2 - 200 - 100 

Thus we can use the induction bound a{G) > f{n',a',d') > ^{d')^{n')^^ lod'a' . Let d' = qd, so 
q < 1/10. As a' < a note that 

f{n',a',d') > —{d')\n'f2+idr77 > J-(d4ni+4<k)(g4^«k(|-i)/8) 
50 50 

= ^(d4^| + 3ck)(e3aS(hl)-41°g(^)). 

400 ^ A ; 

Since 40da < dalog{l/d) < logn/100, we have that ^(| - 1) - 41og(i) > 100(i - 1) - 41og(i) 
and the right-hand side of the above displayed inequality is minimized when q is maximized. By 
using q < 1/10 we see that 

(7{G) > f{n',a',d') > d^n^+iok. 

Case 2 : d' > d/lO. 

Note that by the inequalities d > n~^'^ and n > lO®'', we have 

{d'fn > d^n/800 > n^^^/800 > 1600. 

Therefore we can apply Lemma[3TT]to G[y'] and find a subset Vi C V" with |Vi| > d'n' /50 > (in/4000 
and the property that every pair of vertices in Vi has at least 10^^ {d')^n' > lO^^^d^n internally 
vertex-disjoint paths of length 4 between them, which only use vertices in V" \ Vi. Then by Lemma 
[331 we get a subset U C Vi with (note that d < 10"^° and da > 1/2) 

\U\ > f^r''" \V^\ > f^r''" ^ > e-i^'^"i°s(V^)^ = d'^^^n 
' ' - \8dJ ' ' - \8dJ 4000 " 
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such that the induced subgraph of G with vertex set U has independence number at most /3 := 

\Sda\ > 4. Redefine U as an arbitrary subset of size u = ^e~ " ^^^^' 'n\. Note that u > 

g-i5daiog(i/d)^/2 since dalog{l/d) < (logn)/100. 

Let p = (ti(6-30da)^-i)i/(2/3-i)_ ^g Yi&ve p < 1 from d < 1, da\og{l/d) < (logn)/100, and /3 > 4, 

since 

^2/3-1 ^ ^{6-30da)^-l < rf-30da^-l ^ g30dalog{l/d)^-l < ^3/10 . ^-1 ^ ^_ 

By applying Lemma [3^2] with this value of p to the graph G[U], we get a subset S of size s := \p^~^u\ 
with at most ps"^ non-adjacent pairs. Note that if s = 1, then S contains no non-adjacent pairs, and 
if s > 2, then S contains at most ps^ < Ap'^"~^v? non-adjacent pairs. In any case, S has at most 
4^2/3-1^2 non-adjacent pairs. By definition, /3 = [SdaJ > Sda — 1 and therefore Ibda < 2/3 + 2. 
Thus, the number of vertices in 5 is at least 

and the number of non-adjacent pairs in S is at most 

ps' < 4p'C-\^ < 4(d*'-30da^-l)(^30da^2) ^ 4^6^^ 

The vertices of S form the vertices of a X^-subdivision, where we use the edges between them as 
paths, and for the pairs in S that are not adjacent, we use paths of length 4 between them. We 
can greedily pick these paths of length 4 as there are at most 4d^n edges missing, and each pair of 
vertices have 10~^^d^n > 3 • Ad^n internally vertex-disjoint paths of length 4 between them, where 
we used the fact that d < 10"^*^. This completes the proof. □ 

Note that the lemma above is no longer true if we completely remove the restriction a < n/2. For 
example, if a = n — 1, then we can have d = 1/(2)) for which we have d^n^''^'^^'^ "'^ ^ n. The 
conclusion of the lemma is clearly impossible in this case since the total number of vertices is n. 
The proof of Theorem 11.21 easily follows from the two lemmas above. 

Proof of Theorem II. 2t Let G be a graph with n vertices, edge density d, and independence 
number a. Let c = 10"^*^ and ci = C2 = c' = 10~^^^. If n < lO^^c"^ = 10^^^, then we have 
o'(G) > 1 > c'n. Thus we assume that n > lO^^c"^. 

Case 1: a < 21ogn. 

Ud>c, then by the first part of Lemma Owe have a{G) > (10~6c^/2)^a/(2a-i) > ^/^a/(2Q-i)_ 

On the other hand if (i < c, then we have dalog{l/d) < (logn)/1000 and a < 21ogn < n/2. Lemma 
14.21 therefore implies 

^ ^ - 50 
From the inequality fialog(l/d) < (logn)/1000, we have d^n^/^^^'^") = e-4i°g(V'^)+(i°s")/(80rf") > 

g-41og(l/d)+(25/2)log(l/d) > 1 ^^^ ^j^^g ^i^g j^i^Q^g jg ^^ Ig^g^ 



1,1 1,1 



50 - 50 - 50 - 
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Case 2: Q = alogn for some a > 2. 

li d > ^^^ , then by the second part of Lemma 14.11 we have 



-(G)>,/^,^^>c',' " 



600 y a fog a y a fog a 

Thus we may assume that d < ^ j^^ . If a > n/2, then a > n/(2fogn) and thus c'y^n/(afoga) < 1. 
Therefore we trivially have cr(G) > 1 > c' y^n/{aloga) in this case. 

Otherwise, a < n/2 and d < ^^^ < c since a > 2. We also have dalog(l/(i) < (logn)/100. Indeed, 
as ilog(l/t) is increasing for t < e~^, to verify this inequality one can substitute d = ^^^ and 
a = alogn. By Lemma |4.2| 

o"(G) > — d^n2 + 40d^. 
50 

Note that as in Case 1 we have d^^VCsoda) ^ g-4iog(i/d)+iogn/(80da) > ^^ ^^^ ^^^^ ^^^^ ^^^^^^ ^g ^^ 

least 

i+^_ 1 I 

n2^80da n2 , j Ti 

> ^-r > C , 



50 50 y a log a 



5 Concluding Remarks 

In this paper we established the conjecture of Erdos and Fajtlowicz that ^kjy < '^ " ^ for every 
graph G on n vertices. The main part of the proof is Theorem 11.21 which gives a lower bound on 
/(n, a), the minimum of (t{G) over all graphs G on n vertices with a{G) < a. It would be interesting 
to determine the order of growth of f{n,a). As remarked in the introduction, determining f{n,a) 
is equivalent to the following Ramsey-type problem. Determine the minimum n for which every 
red-blue edge-coloring of Kn contains a red subdivision of Kg or a blue Ka+i- 

Theorem 11.21 and the remarks afterwards determines the order of growth of f{n,a) for a = 2 and 
a = G(logn). We conjecture that the lower bound f{n,a) > cin2Q-i is tight up to a constant factor 
also for all a < 21ogn. As in the case a = 2, to prove such a result it would be sufficient to find a 

a. 

i^Q+i-free graph G on n vertices in which every subset of order Cn^^-^ contains at least n edges. 
Then the complement of G will have independence number at most a and no clique subdivision 
of order Cn2a-i (the proof is as in the case a = 2, see the introduction). A potential source of 
such graphs G are {n, d, A)-graphs, introduced by Alon. An (n, d, A)-graph is a d-regular graph on 
n vertices for which A is the second largest in absolute value eigenvalue of its adjacency matrix. 
An (n, d, A)-graph G which is i^^+i-free with d = Q in ~2a-i J and A = O ( Vu) would satisfy 
the desired property by the expander mixing lemma (see, e.g., Theorem 2.11 in the survey [15J). 
It is worth mentioning that this would be up to a constant factor the densest (n, d, A)-graph with 
A = 0(vd) which is Ka+i-bee (see, e.g.. Theorem 4.10 of [H]). The construction of Alon p] in the 
case a = 2 is the only known example of such graphs. We think it would be quite interesting to find 
examples for larger a, which would have other applications as well. 
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We make the following conjecture on the order of the largest clique subdivision which one can find 
in a graph with chromatic number k. 

Conjecture 5.1 There is a constant c > such that every graph G with chromatic number x{G) = k 
satisfies cj{G) > c\/klogk. 

The bound in Conjecture 15.11 would be best possible by considering a random graph of order 
0{klogk). Recall the result of of Bollobas and Thomason [5] and Komlos and Szemeredi [13j which 
says that every graph G of average degree d satisfies cr{G) = il(\/d). This is enough to imply the 
bound <t{G) = Q{\/k) for G with xiG) = k, but not the extra logarithmic factor. Much of the 
techniques developed in this paper to solve Theorem 11.11 are most useful in rather dense graphs. 
These techniques do not appear sufficient to solve Conjecture l5.lt as in this conjecture one needs to 
handle clique subdivisions in rather sparse graphs. 

Ackno'wledgments. We would like to thank Noga Alon for helpful discussions. We would also like 
to thank the two anonymous referees for their valuable comments. 
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